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Abstract — In this article we explore a modification in tiie 
problem of controlling the rotation of a two level quantum 
system from an initial state to a final state in minimum time. 
Specifically we consider the case where the qubit is being weakly 
monitored - albeit with an assumption that both the measure- 
ment strength as well as the angular velocity are assumed 
to be control signals. This modification alters the dynamics 
significantly and enables the exploitation of the measurement 
backaction to assist in achieving the control objective. The 
proposed method yields a significant speedup in achieving the 
desired state transfer compared to previous approaches. These 
results are demonstrated via numerical solutions for an example 
problem on a single qubit. 

I. Introduction 

One of the requirements common to many applications of 
quantum engineering [1], [2], [3], is that of being able to 
create a quantum system in a particular pure quantum state. 
Moreover, it is desirable to achieve such a state value in as 
short a time as possible. This has motivated research into 
this domain of time optimal control of quantum systems; 
Although has been a significant amount of research on 
optimal control for closed quantum systems [4], [5], [6], [7], 
[8], the investigation into the optimal control of monitored 
open quantum systems still remains to be developed fully 
[9], [10], [11], [12], [13]. 

In this work we address the problem of transferring the 
state of a two level quantum system from an initial pure state 
to a final pure state. In particular develop the idea proposed 
in [14] on the possible use of measurement backaction for 
speeding up the control of a qubit to a greater extent that 
that achievable by fixed measurement alone. The framework 
for this problem is unique due to the fact that both the 
angular rotation as well as the measurement strength for the 
continuous measurement being performed are control signals 
that can be varied to achieve the desired objective. This 
leads to a significant speedup in the time required to reorient 
the system between two pure states. Intuitively we use the 
fundamental property of measurement backaction, unique to 
quantum systems, to help achieve the state transition more 
rapidly than that possible using either: static measurement 
and feedback or that with no-measurement and a control 
driven by a constant rotation (termed the Hamiltonian evo- 
lution). 

The structure of this article is as follows. In section |ll] 
we describe the system model and frame the optimal control 



problem of interest. To solve the problem described above, 
we utilize the dynamic programming approach from optimal 
control theory in section [III] It turns out however, that the 
Hamilton- Jacobi-Bellman equations associated with this con- 
trol problem may not have classical solutions. We discuss this 
issue and indicate a numerical method to obtain the solution 
in section III Two cases of interest in the optimal control 



of the state of a quantum system are transition between 
states which are: (a) parallel or (b) orthogonal to the axis of 
measurement. The solution to the control problem for these 



different cases are described in section IV We then highlight 
the performance of the dynamic measurement and control 
strategy introduced herein with respect to that achieved using 
either only static measurement strength or with pure rotation 
alone (with no measurement). We conclude with a discussion 
of further interesting questions that arise from these results, 
in section IVTl 

II. System Description and further background 
A. The model 

We now explain the model for the system of interest - the 
continuous measurement and feedback control of a single 
qubit. A more comprehensive introduction to continuous 
quantum measurements can be found in [15], [16]. The 
state of such a system may be represented by a vector in 
a 3 dimensional real unit sphere (termed the Block sphere). 

Consider a quantum spin system subject to measurement 
along the z axi^ with control signals that induce a rotation 
around the three axis. Let 7 denote the measurement strength 
- a parameter that determines the rate at which the informa- 
tion is extracted from a system. A larger value of 7 leads 
to a greater rate of information extraction and therefore a 
higher rate at which the system is projected on to one of 
the eigenstates of the observable used [in this case it is the 
Pauli matrix a, diag(l, — 1)]. For the particular choice of 
measurement considered herein, these eigenstates correspond 
to the up (+z) direction and the down (— z) direction on the 
Bloch sphere. The goal of the feedback is to take the initial 
state (say |0) i.e., the +z direction) and control it to the 
orthogonal state (say |1) i.e., the —z direction) in a time 
optimal manner We assume that: 

1) the initial state is pure and that measurements are 
efficient. This implies that the evolution of the state 
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i.e., this measurement is of tlie observable a,. 



vector is confined to the surface of the Bloch sphere. 
2) the available control is equal in strength (isotropic) 
about all 3 axes. 

Under the above assumptions, by using the symmetry of the 
problem the Bloch representation reduces (after a change of 
variables) to a lower dimensional model; Herein the state is 
constrained to lie on the unit circle i.e., the x-z plane. The 
control signal causes the state to rotate around the y axis. 
The control problem for this system involves moving the 
state between any two states on the unit circle. 

The system described above is modeled using a stochastic 
differential equation (SDE) of the form 

ddit) = a{t) dt - 27(f) sin(20(f))c/f - 2^2y{t) sin(0(f))c/W. 

(1) 

where Q E [—k,k]. The term a denotes the control signal 
applied. To ensure that the control problem is well posed 
we apply a bounded strength control, i.e., the controls are 
constrained to a closed compact set V := [— Here 
the maximum and minimum values that the control signal 
can take up at any time instant are symmetric and have a 
magnitude £1. We denote the set of piecewise continuous 
angular velocity signals by the term Y. 

The second term in the equation above is the quantum 
measurement backaction. This can be intuitively understood 
by setting the angle to or ±7t, so the state vector and 
the measurement axis commute and the backaction goes 
to zero. Similarly at the point 9 — ±7t/2 where the state 
and measurement axes are maximally non-commuting, the 
measurement back-action is largest. The term 7(f) denotes 
the measurement strength which is also a control term in this 
problem. The values allowed for the measurement strength 
belong to the range [0,r]. The set of measurement strength 
signals is denoted by 3. The final term in Eq. ([TJ indicates 
the innovation term arising from measurements. 

We denote the control signal pair (a, 7) as rj and the 
set from which this signal is drawn as S := ^ x 3. The 
solution to the SDE ([TJ for a trajectory, starting at a point 
00 (at a time to) and using a control strategy T] e E, at a 
time instant t G [to,°°) is denoted by 0(f ; Tj,fo, 0o)- Note that 
the this trajectory is a particular sample path of a stochastic 
process. In cases where the arguments used in this notation 
are clear from the context we represent the solution at time 
t by 9,. 

B. Performance measure: the expected (discounted) hitting 
time 

In order to capture the time optimality requirement of the 
problem, the most intuitive approach is to model the problem 
as an optimal control problem with a cost function that 
measures the expected hitting time to the target set (denoted 
by 3^). Define the hitting time as 
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Fig. 1. The Bloch sphere with a graphical depiction of our control problem. 
We start in the plus eigenstate of the observable a- and rotate to the 
orthogonal state — 



The cost function based on the expected hitting time will 
take the form 
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(3) 
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4(0o)-inf{f|0(f;T7,fo,0o)e^}, 



(2) 



However, as described in the next section, application of 
the dynamic programming principle to solve this optimal 
control problem leads to an associated Hamilton-Jacobi- 
Bellman (HJB) equation whose classical solution may not 
exist. Thus we use a revised discounted cost function which 
ensures the uniqueness a weak solution in similar classes of 
optimal control problems (c.f. [17], [18]) 

C(0o):=infEM exp{-j3i}t/i L (5) 

The parameter )3 > is called the discount factor. One 
interesting aspect of this discounting is that the cost function 
remains bounded for any choice of the control signal. The 
motivation for and advantages of this discounting will be 
discussed in detail below. 

III. Optimal control for the hitting time 

PROBLEM 

In order to obtain the optimal cost function (Eq. (|5|) and 
the corresponding control strategy for the system of interest, 
we apply the dynamic programming [19], [20] approach from 
optimal control theory. 

Note that the system dynamics in Eq. Q is an SDE of 
the form 

dx^\}{x,a,y)dt + a{x,y)dW. (6) 

By comparison, the coefficients b, (J can be seen to be 

b(;c,a,7) :=a-27sin(20), (7) 
^ (8) 



a{x,y) :=2^27sin(0) 



i.e., for each sample path it is the first time at which the 
trajectory reaches the target set (and is a random variable). 



-We defer the discussion of the weak solutions and a rigorous justification 
of the discounted cost in this problem to the future. 



We introduce a differential operator L''[0](y) given by the 
expression 



d(j) 



(9) 



which is the generator of the It5 diffusion process Eq.([T]i. The 
application of dynamic programming to this optimal con- 
trol problem yields the following Hamilton-Jacobi-Bellman 
equation over the set G (— Tf, n): 

sup{-l+j3^-Ll[0](y)} = O, VyeG (10) 

7)GZ 

with boundary conditions 



(11) 



The classical solution to this partial differential equation 
(PDE) yields the discounted cost function in Eq. (|5]l. 

Note that the HJB equation ([TO]l is an elliptic PDE [21] 
with a coefficient for the second order derivative that can 
become zero at any point in the domain G - therefore it 
is called a degenerate elliptic PDE. The positivity (non- 
degeneracy) of this second order term is a sufficient condition 
for the existence of a classical solution to this PDE [22], 
[17]. Hence, due to the nature of the C7(-, ) term in the 
system dynamics being able to take up a value of zero, 
there arises a degeneracy owing to which the HJB equation 
is not guaranteed to have a sufficiently smooth solution. 
Therefore a rigorous study of the solution to the optimal 
control problem necessitates an analysis of the solution to 
this equation in a weak sense. 

It is interesting to note that an alternate approach used in 
the literature to determine the hitting time involves solving 
a PDE termed the Fokker-Planck equation [23], [24] . The 
solution to this equation is the probability density of the 
distribution of the hitting time (from which we can evaluate 
the expectation of the hitting time). The degeneracy indicated 
above also arises naturally in the Fokker-Planck equation, 
thereby giving rise to the same issues of non-existence of 
classical solutions. 

In this article our focus is on obtaining and analyzing the 
optimal control strategy and the improvement obtained in the 
time optimality in state transfer compared to that achieved 
by other strategies. Hence we defer the analysis of these 
questions of existence and uniqueness of the generalized 
solutions for this problem to a future publication. 

A. Numerical solution 

One widely applicable method for computing the solution 
to optimal control problems is the Markov chain approx- 
imation method [25], [17]. In this approach the system 
dynamics are approximated by a controlled Markov chain on 
a finite state space. The cost function is then approximated 
by a discretization suited to this chain. Thus an iteration 
is constructed which converges to the desired cost function 
under the limit that the discretization converges towards the 
original formulation. For a more detailed introduction to this 
approach and other applications to quantum control we refer 



the reader to [25], [26], [18] and the references therein. We 
now outline the iterative procedure to solve for the cost 
function. 
Define 



max{a,0}, a max{— fl,0}. 



(12) 



We denote the spatial discretization interval by 'h\ In 
addition, we use the expression 

1 



exp{-j3Af} : 



l+j3Ar^ 



(13) 



to approximate the exponential weighting term. We generate 
a grid G^' that approximates the set G (for instance using 
a mesh with step-size h). The discretization for the HJB 
equation ( [TO] i yields 



</)'\x) =min ^/(x,y|T7)0''(y)+Af"(x,T7) 



l+j3Af/'(x,T7) 



x£G (14) 



where the summation is over all points y neighboring x. The 
terms in the equation above are functions that are given 
by [25]. 

(7^ {x, y) /2 + hb+ (x,a,Y) 



p"{x,x + h\ri) 
p'\x,x-h\ri) 
p''{x,x\ri) 



(15) 

a*^{x)+hB*{x) ' ^^^^ 
[a*^{x) - 7)] + hB* (x) - h\b{x, a, y) | 



a*^{x)+hB*{x) 
(x, 7) /2 + hh^ (x, a, 7) 



(7*2(x)+/lB*(x) 



(17) 



where 



B*(x) := a + 2r|i!n(2x)|, a*(x) := 2^2Ysin{Q) 



At'\x,r]) 



a*2(x) + /iB*(x) 



Denoting the RHS of Eq. 14 as an operator 



the value function 0(-) we obtain the iteration 

(/./,Vi(x)=^(</.i')(x), XGG", 



(18) 
(19) 

acting on 
(20) 



Under appropriate choices of parameters, this operator £, can 
be shown to be a contraction mapping, thereby yielding the 
necessary convergence. The results obtained by applying this 
iterative method to the problem of interest will be described 
in the following section. 

IV. Numerical Examples 

In this section we describe the solution to the optimal 
control problem for two cases of interest: 

1) when the initial and final states for the control problem 

are eigenstates of the observable cr^ i.e., to move from 

|0) to |1) (ref. Fig.[l}. 
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Fig. 2. The Bloch sphere with a graphical depiction of our control problem. 
We start in the plus eigenstate of the observable CT,- and rotate to the 
oilhogonal state —\x). 



2) when the states are both maximally non-commuting 
with respect to the measurement (ref. Fig. [2]) with a; 
i.e., the problem is to go from +\x) to — \x). 
These two cases are of interest since they help clarify 
whether the control problem between two orthogonal states 
depends on the nature of the initial and terminal points. 
The optimal control problems are solved numerically via the 
value iteration approach described in the previous section. 

In order to implement this approach we include a stop- 
ping criteria for the value iteration algorithm, which in our 
approach is obtained from a stopping test function - the 
maximum absolute value of the change in the cost function 
over all grid points. Once this goes below a fixed threshold, 
we stop the value iteration. Note that this is possible due to 
the fact that the value iteration operator ^ is a contraction 
mapping (if not, there would be no reason for this stopping 
test function to remain below the threshold in subsequent 
operations). 

A. Optimal transition between eigenstates 

In this case the states take up values from the set G := 
(— ;r,+;r). Assume a control bound of i2 = 5, a discount 
factor of )3 — 0. 1 and a measurement strength of F = 1 . Ap- 
plying the value iteration procedure, we obtain the solution 
to the HJB PDF ( [TOj i subject to the boundary condition of 
C(±7r) = 0. The result obtained is as indicated in F ig. [3 the 
corresponding optimal control is as shown in Fig. 4(a) and 



measurement strength is indicated in Fig. 4(b) Hence it turns 
out that the optimal control is consistent with the intuition of 
exercising a clockwise rotation when starting at any point to 
the right of the (up) state and counterclockwise rotation 
to the left of this state. Note that the optimal measurement 
strategy in this case is to turn off measurement tiU arriving 
at the state = 7r/2. 

B. Optimal transition between non- eigenstates 

We now study the optimal control problem of taking 
the initial state Qq ~ n/2 to the target state of = 
{ — ;r/2,3;r/2} as depicted in Fig. |2] In this case the region 
of interest is G :— {—n/2,3n/2) and the HJB equation as- 
sociated with this problem is ( [TO) i with a boundary condition 
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Fig. 3. Discounted hitting time cost function to the target :— {—Tl.Tl}, 
starting from various possible initial states with H — 5, /3 — 0.1 and F — 1. 
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(a) Optimal angular velocity control signal to the target^ :— {—7l,7t}, 
starting from various possible initial states with H — 5, /3 — 0.1 and F — 1 
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(b) Optimal measurement strength signal to the target := {—7Z,7c}, 
starting from various possible initial states with = 5, j3 = 0.1 and F = 1. 

Fig. 4. Transition between eigenstates. 



of C{^„e) — 0. The value iteration approach for this problem 
yields the solution (ref. Fig. |5]). This control strategy in 
this case involves an angular rotation of +Q. for all states 
corresponding to angles between (;r/2,3;r/2) and counter- 
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Fig. 5. Discounted hitting time cost function to the target<%f := 
{— ;r/2, 3;r/2}, starting from various possible initial states with £1 = 5, 
/3 = 0.1 and r= 1. 



clockwise control for states in the domain (— ;i;/2, 7i:/2). 

V. Interpretation 

In this section we analyze and compare the performance 
of the variable measurement strength approach proposed 
herein to two alternate control approaches - pure Hamiltonian 
rotation and fixed strength control. The strategy used as the 
base line for these comparisons is the case of pure rotation 
with no measurement. We now analyze the salient features 
of these strategies. 

A. Fixed measurement strength 

For the case of rotation between eigen states, from Fig. [3] 
it can be seen that for angles from [0,7r/2) the fixed 
measurement strength strategy performs worse than the pure 
rotation approach; However for angles between {n/2^n) 
the backaction from the fixed measurement helps project 
the state towards the target thereby outperforming the pure 
rotation strategy. This is in contrast to the control between 
non-eigen states ±|x) where the fixed measurement strategy 
performs better than the fixed rotation close to the starting 
point but worse for all points further away (except at the 
target). 

B. Dynamic measurement strength 

As can be observed from Figures [3] |5] the time for the 
dynamic measurement approach is always smaller than that 
for both: the pure rotation strategy (in fact they are equal only 
at the boundary), and for the case of a static measurement. 
For Q e [0,;r/2] the time to reach d — n is substantially 
different. Hence using a dynamic control and measurement 
scheme shortens the hitting time to the desired state by 
a significant margin. In the case of the transfer between 



the states orthogonal to |z) (i.e., \x)) we have that this 
strategy does better than the pure rotation strategy only for 
starting points from (0, k) and equal to the rotation strategy 
at all other points. This is intuitive as the the dynamic 
measurement strategy leads to the measurement signal being 
switched off for points between (— 7i;/2,0) and {n,3n/2) - 
leading to the use of only the maximum magnitude of the 
available angular rotation. 

VI. Conclusion and Future Work 

In this article we described an approach for the time 
optimal rotation of a quantum two level system. The dynamic 
measurement and velocity control strategy led to a speedup in 
the hitting time compared to strategies used previously in the 
literature. Numerical solutions to certain example problems 
were indicated and the analysis of the solutions led to the 
following interesting avenues for future investigation. 

The special form of the nonlinear dynamics in the system 
under consideration, leads to a degenerate Hamilton-Jacobi- 
Bellman equation associated with the optimal control prob- 
lem - this necessitates a weak (viscosity) solution interpre- 
tation for the solution to these control problems. Hence this 
remains to be investigated in greater detail. Another aspect of 
note in this problem is the fact that the optimal measurement 
and control strategy are not separable]^ i.e., the observation 
strategy depends on a knowledge of the angular velocity con- 
trol. Certain aspects of this problem appear to parallel those 
in the well known Witsenhausen counterexample [27], [28], 
[29]. Inspired by these ideas, we note that in the quantum 
control problem, one possible viewpoint is to analyze this 
problem as a combination of two interrelated controllers (i) 
rotational velocity control; (ii) measurement control, with an 
overall time optimal strategy depending on two controllers 
in a distributed control paradigm. In this arrangement, the 
knowledge of the state from the first controller is passed to 
the second controller This potentially gives rise to a non- 
classical information pattern. The analysis of the impact of 
this on optimal control and measurement and its relationship 
to the foundations of decentralized control also offers a 
fruitful topic for future work. 
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